Superquantile/C  VaR  Risk  Measures:  Second-Order  Theory1 


R.  Tyrrell  Rockafellar 

Department  of  Mathematics 
University  of  Washington 
rtr@uw.edu 


Johannes  0.  Royset 

Operations  Research  Department 
Naval  Postgraduate  School 
joroyset@nps.edu 


Abstract.  Superquantiles,  which  refer  to  conditional  value-at-risk  (CVaR)  in  the  same  way  that 
quantiles  refer  to  value-at-risk  (VaR),  have  many  advantages  in  the  modeling  of  risk  in  finance  and  en¬ 
gineering.  However,  some  applications  may  benefit  from  a  further  step,  from  superquantiles  to  second- 
order  superquantiles.  Measures  of  risk  based  on  second-order  superquantiles  have  recently  been  explored 
in  some  settings,  but  key  parts  of  the  theory  have  been  lacking:  descriptions  of  the  associated  risk  en¬ 
velopes  and  risk  identifiers.  Those  missing  ingredients  are  supplied  in  this  paper,  and  moreover  not  just 
for  second-order  superquantiles,  but  also  for  a  much  broader  class  of  mixed  superquantile  measures  of 
risk.  Such  dualizing  expressions  facilitate  the  development  of  dual  methods  for  mixed  and  second-order 
superquantile  risk  minimization  as  well  as  superquantile  regression,  a  proposed  second-order  version  of 
quantile  regression. 
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1  Introduction 

The  second-order  version  of  conditional  value-at-risk  that  we  introduced  in  [21],  with  further  explana¬ 
tions  in  [20,  23],  corresponds  to  a  sort  of  smoothing  of  the  cumulative  distribution  function  of  a  random 
variable  but  has  other  key  interpretations  as  well.  Motivated  by  specific  applications  in  risk-averse 
optimization  and  regression  for  physical  systems,  we  develop  it  further  here  with  particular  attention 

1This  material  is  based  upon  work  supported  in  part  by  the  U.  S.  Air  Force  Office  of  Scientific  Research  under  grants 
FA9550-1 1-1-0206  and  F1ATA01194G001  and  DARPA  under  grant  HR0011517798. 
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to  duality.  The  term  “superquantile”  as  an  alternative  to  “conditional  value-at-risk”  [19]  is  employed 
for  this  broad  purpose,  beyond  the  usual  domain  of  finance. 

To  understand  the  second-order  ideas  with  which  we  will  be  occupied  in  this  paper,  some  back¬ 
ground  in  the  first-order  ideas  is  needed,  and  we  begin  briefly  with  that.  The  conditional  value-at-risk 
CVaRQ(V)  of  a  random  variable  X  oriented  to  “loss”  or  “cost,”  at  a  probability  level  a  €  [0,1),  is 
the  expected  value  of  the  a-upper  tail  distribution  of  X  as  defined  in  [24,  25].  When  the  cumulative 
distribution  function  F\  for  X  is  continuous  at  VaRQ(V),  the  value-at-risk  of  X  at  level  a,  this  tail 
distribution  is  simply  the  conditional  distribution  for  X  with  respect  to  the  interval  [VaRa,oo),  but 
otherwise  it  requires  taking  into  account  an  atom  of  probability  at  VaRa(V).  This  distinction  makes 
CVaR  different  from  other  notions  introduced  around  the  same  time,  such  as  “tail- VaR”  [2],  which 
includes  the  entire  probability  atom,  and  “mean  shortfall”  [12],  which  omits  it  (although  the  similar 
term  “expected  shortfall”  has  been  ambiguous  in  this  respect).  Conditional  value-at-risk  can  also  be 
expressed  by  the  formula 

1  f1 

CVaRa(V)  =  - -  /  VaR p(X)d/3 

l  a  j  a 

of  [1],  which  was  adopted  by  Follmer  and  Schied  as  the  definition  of  “average”  value-at-risk  [7]. 2 

These  other  concepts  were  originally  articulated  for  random  variables  oriented  toward  gain,  but  the 
loss  orientation  we  follow  here  has  the  advantage  of  making  the  value-at-risk  VaRa(V)  coincide  with 
the  a- quantile  qx(ot)  familiar  in  statistics: 

VaRQ(V)  =  qx(a)  =  min{.x  €  M\Fx(x)  >  a}. 

This  uniting  of  VaR  with  quantiles  has  further  suggested  a  way  of  exiting  from  finance-driven  termi¬ 
nology  about  risk  for  the  sake  of  applications  outside  of  finance,  namely  by  speaking  of  the  conditional 
value-at-risk  CVaRa(X)  as  the  a- superquantile  of  X  in  the  parallel  notation  qx(a)-  Then  the  integral 
formula  for  CVaRa(V)  becomes 

Qx(a)  =  [  qx(fi)d[F  (1) 

1  -  a  Ja 

With  this  shift  we  have  a  platform  for  displaying  the  second-order  superquantiles  of  [21],  to  be 
denoted  by  qx(a :);  they  are  defined  by 

Qx(a)  =  [  qx{fi)d(F  (2) 

1  -  a  Ja 

There  is  more  to  the  second-order  superquantile  than  just  the  analogy  between  (1)  and  (2),  though,  as 
has  been  laid  out  in  [21]. 

Especially  of  interest  is  a  formula  derived  in  [21]  that  extends  to  superquantiles  and  second-order 
superquantiles  the  basic  connection  between  VaR  and  CVaR  discovered  in  [24,  25].  That  earlier  formula 
asserts,  in  quantile/superquantile  notation,  that 

qx(a)  =  min{c  +  Va(X  -  c)},  qx(u)  =  argminjc  +  Va(X  -  c)},  (3) 

CS-R  C£R 

2They  preferred  “average”  because  “conditional”  could  have  differing  usages.  This  issue  also  adds  motivation  to  our 
passage  to  “superquantiles.” 
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in  terms  of  the  “regret”  functional 

Va(X)  =  — 1 — £[max{0,  A}]  =  [  max{0 ,qx(P)}d/3 

1  —  a  1  —  a  Jq 

(with  the  argmin  being  an  interval  which,  if  not  a  singleton,  has  the  quantile  in  question  as  its  left 
endpoint).  The  second-order  extension  asserts  that 

q\(a)  =  min{c  +  Va(X  -  c)},  qx(oi)  =  argminjc  +  VQ(X  -  c)},  (4) 

cei?  C£R 

where 

Va(X)  =  — * —  [  max{0,  qx(f3)}d/3.  (5) 

1  -  «  Jo 

Achieving  such  a  formula  had  been  one  of  our  main  goals  in  pursuing  second-order  superquantilies, 
because  it  is  deeply  tied  to  generalized  regression.  The  joint  formula  (3)  is  central  to  quantile  regression, 
a  well  known  alternative  to  ordinary  least-squares  regression,  so  the  joint  formula  (4)  indicates  a  possible 
elevation  to  superquantile  regression.  The  double  formula  (4)  was  developed  in  [21]  through  a  technique 
in  which  the  superquantiles  of  X  could  be  interpreted  as  the  quantiles  of  a  “super”  random  variable  X 
associated  with  X. 

Our  primary  aim  in  this  paper  is  to  fill  in  missing  parts  of  the  second-order  theory  concerned 
with  duality.  An  important  ingredient  of  duality  for  any  coherent  measure  of  risk,  including  superquan- 
tile/CVaR,  is  an  expression  of  the  risk  as  a  worst-case  expectation  over  an  associated  class  of  probability 
measures.  This  requires  identifying  the  “risk  envelope”  that  characterizes  that  class.  The  risk  envelope 
for  the  risk  measure  given  by  second-order  superquantiles  has  not  yet  been  fully  understood,  but  we 
will  pin  it  down  here. 

This  pushes  us  naturally  into  wider  terrain  in  observing  that  the  integral  formula  for  the  second- 
order  superquantile  casts  it  as  a  special  “spectral”  measure  of  risk  of  X  in  the  sense  of  Acerbi  [1]. 
Spectral  measures  of  risk,  which  have  also  been  studied  from  various  angles  under  the  heading  of  mixed 
superquantile/CVaR  measures  of  risk  [28,  26],  are  known  to  be  fundamental  for  characterizing  coherent 
measures  of  risk  that  are  law-invariant  [11,  7,  10,  17,  15,  32],  Some  properties  of  risk  envelopes  of 
mixed  superquantile  measures  of  risk  are  known  for  the  setting  with  random  variables  defined  on  a 
nonatomic  probability  space;  see  [9,  Sections  4.5  and  4.6]  and  [32].  We  provide  a  development  for 
arbitrary  probability  spaces  and  give  explicit  formulae  for  risk  envelopes  and  identifiers,  especially  for 
second-order  superquantile  risk  measures.  Moreover,  in  the  broader  framework  of  risk  quadrangles  [26], 
we  provide  a  deeper  understanding  of  the  risk  quadrangles  with  (first-order)  superquantile/CVaR  as  its 
statistics  and  for  the  first  time  state  explicit  expressions  for  risk  envelopes  and  identifiers  in  the  case  of 
(finite)  mixture  of  quantiles  as  the  statistic. 

Although  dualization  of  risk  measures  can  be  carried  out  for  a  variety  of  spaces  of  random  variables 
and  paired  dual  spaces  (see  for  example  [31,  4,  9]),  we  focus  here  on  random  variables  with  finite  second 
moments.  This  excludes  some  applications,  for  example  in  finance,  where  only  the  first  moments  are 
finite.  Still,  in  many  applications  in  engineering  and  generalized  regression,  the  assumption  appears 
tolerable.  A  compelling  reason  in  our  setting  is  that  this  restriction  guarantees  the  finiteness  of  second- 
order  superquantiles.  That  follows  from  their  expression  as  an  integral  of  first-order  superquantiles  and 
the  bounds  derived  for  the  latter  under  such  restriction  in  [23,  Proposition  1],  namely 


E[X]  <  qx{o( )  <  mini  £?[A]  4 —  ^  =,  sup  A 1 

l  V 1  -  «  J 
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where  <r(A )  denotes  standard  deviation  and  the  lower  bound  is  strict  for  nonconstant  X  unless  a  =  0. 
Another  plus  is  that  this  choice  allows  for  random  variables  with  normal  distributions,  whereas  much 
of  the  literature  in  finance  restricts  consideration  to  random  variables  with  essentially  bounded  range. 

In  Section  2  we  lay  the  foundation  for  working  in  this  framework  and  the  measures  of  risk  that  fit 
into  it.  We  proceed  in  Section  3  with  the  central  results  of  duality  concerning  risk  envelopes  and  the 
risk  identifiers  they  associate  with  random  variables.  Section  4  then  applies  the  results  to  optimization 
and  generalized  regression.  An  appendix  collects  some  of  the  technical  details  that  are  needed  along 
the  way. 

2  Risk  Measure  Framework 

For  a  probability  space  (II,  F,  IP),  we  let 

C2  =  £2(D,  A,P)  :=  {X  :  0  ->  M  \  X  ^-measurable,  E[X2]  <  oo} 

be  the  space  of  random  variables  with  finite  second  moment,  where  we  write  integration  with  respect  to 
P  using  the  standard  notation  E[X\  =  fn  X(cu)dP(cv).  We  equip  C2  with  the  standard  norm  || AC ||2  := 
(^[A2])1/2.  As  explained  in  the  introduction,  the  choice  of  C2  ensures,  through  (6),  the  finiteness  of 
the  second-order  superquantiles  qx{of)  we  are  especially  focused  on. 

In  the  following,  we  deal  with  classes  of  measures  of  risk  defined  on  C2 .  Regularity  [26,  22]  provides 
fundamental  properties  for  such  risk  measures.  We  recall  that  a  measure  of  risk  TZ  :  C?  — >•  (— oo,  oo]  is 
regular  if  it  satisfies  the  following  axioms: 

TZ(X)  =  c  for  constant  random  variables  X  =  c, 

7C((1  —  t)X  +  tX')  <  (1  —  t)TZ(X)  +  tIZ(X')  for  all  A,  X'  G  C2  and  r  G  (0, 1)  (convexity), 

{A  G  C2  |  TZ{X)  <  c}  is  closed  for  all  c  G  -2?  (closedness), 

7C(A)  >  E[X]  for  nonconstant  A  G  C2  (averseness), 

which  have  as  a  consequence  that  1Z(X  +  c)  =  1Z(X)  +  c  for  all  cGE.  In  fact  we  will  only  be  working 
here  with  risk  measures  that  in  addition  are  both  positively  homogeneous , 

1Z{tX)  =  tTZ(X)  for  r  >  0,  A  G  £2, 


and  monotonic , 

1Z( A)  <  7 Z(Y)  whenever  X(uS)  <  Y (uj)  for  a.e.  u  G 

In  particular  1Z  is  then  a  coherent  measure  of  risk  in  the  sense  of  [2],  Duality  in  this  case  is  expressed 
by  the  following  correspondence  between  risk  measures  1Z  and  sets  Q  called  their  risk  envelopes .3 

2.1  Proposition  (risk  envelope  duality).  For  a  regular  measure  of  risk  1Z  on  C2  that  is  positively 
homogeneous  and  monotone,  the  relations 

7Z(X)  =  sup  E[XQ\  for  X  G  C2 ,  Q  =  {Q  e  C2  \  E[XQ }  <  1Z{ X)  for  all  X  G  £2}, 

Q&Q 

3The  term  “risk  envelope”  was  introduced  in  2002  in  [27]. 
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give  a  one-to-one  correspondence  between  the  regular  measures  of  risk  TZ  on  C2  that  are  positively 
homogeneous  and  monotonic  and  the  nonempty  closed  convex  subsets  Q  of  C2  that  consist  of  elements 
Q  >  0  with  E[Q\  =  1  and  are  such  that  each  nonzero  X  €  C2  has  E[XQ\  >  0  for  at  least  one  Q  €  Q. 

This  fact,  a  specialization  of  the  general  support  function  correspondence  in  convex  analysis,  is 
a  variant  from  [26]  of  known  results  characterizing  other  classes  of  risk  measures,  starting  with  [2]. 
Important  along  with  the  risk  envelope  Q  associated  with  TZ  are  the  sets 

Qx  =  argma xE[XQ\  for  X  £  £2,  (7) 

QeQ 

which  are  called  the  risk  identifiers  for  the  individual  random  variables  X.4 

The  measures  of  risk  at  the  center  of  our  attention  are  the  first-order  superquantile  measures  TZa 
and  the  second- order  superquantile  measures  TZa  given  by 

TZa(X)  =  qx{ot )  and  TZa(X)  =  qx(a)  for  a  €  [0, 1)  (8) 

in  accordance  with  the  expressions  (1)  for  qx{ot)  and  (2)  for  qx{ot)  in  Section  1.  The  properties  of 
7Za  that  make  it  regular,  positively  homogeneous,  and  monotonic  have  been  known  for  some  time,  and 
those  properties  are  obviously  inherited  by  TZa  through  the  expression  of  qx(ct)  as  an  integral  in  (2). 
In  both  cases,  therefore,  we  are  dealing  with  measure  of  risk  covered  by  the  preceding  theorem.  For 
7Za,  the  risk  envelope  is  known  to  be 

Qa  ■=  {Q  e  C2  |  0  <  Q(u)  <  1/(1  —  a)  a.e.  u  £  O,  E[Q\  =  1},  (9) 

cf.  [27,  26].  For  TZa,  the  specifics  of  the  corresponding  risk  envelope  Qa  will  be  determined  for  the 
first  time  in  Section  3;  some  properties  are  known  from  [9,  Sections  4.5  and  4.6]  and  [32]  under  the 
additional  assumption  that  (O,  IP)  is  nonatomic.  However,  to  accomplish  this  efficiently  and  gain 
other  new  insights  at  the  same  time,  we  will  pass  through  a  broader  class  of  risk  measures  that  we 
call  mixed  superquantile  measures  of  risk.  Such  measures  are  already  known,  having  been  identified  by 
Acerbi  [1]  as  corresponding  broadly  to  “spectral”  measures  in  the  pattern  of  dual  utility  theory.  They 
are  a  key  ingredient  in  the  sup  representation  that  Kusuoka  [11]  has  provided  for  a  class  of  law- invariant 
measures  of  risk,  later  also  studied  in  [16,  32], 

2.2  Definition  (mixed  superquantile  measures  of  risk).  For  a  weighting  measure  A,  namely  a  proba¬ 
bility  measure  on  ([0, 1) ,  i5[0;i) ) ,5  the  associated  mixed  superquantile  measure  of  risk  TZ  :  C2  — *  (—00,00] 
is  given  by 

1Z(X)  :=  [  qx{fi)  d\(/3).  (10) 

Jo 

For  technical  reasons,  we  exclusively  deal  in  this  situation  with  the  completion  of  ([0,  l),H[o,i),  A), 
which,  with  a  slight  abuse  of  notation,  we  denote  by  ([0, 1),  Hr0ji),  A). 

4This  term  was  introduced  in  [28],  although  the  sets  in  question  were  handled  earlier  as  being  the  subdifferentials  of 
convex  analysis  for  the  risk  measure  functionals  in  question. 

5  For  a  set  S  with  a  topology,  let  Bs  be  its  Borel  sigma-algebra. 
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Here,  the  key  aspect  is  that  second-order  superquantile  risk  measures  fit  this  definition  because, 
through  (2),  we  have 

Ha(X)  =  qx(a )  =  f  qx(a)  d\a(/3),  where  A a(S)  :=  m^  n  ^ a '  ^  for  S  <E  H[0,i).6  (11) 

Jo  1  -  a 

As  another  special  case,  if  A  is  concentrated  on  a  finite  number  of  points  in  [0, 1),  say  oq,  a2,  ...,  oik, 
then  simply  77(A)  =  A(ai)gx(ai)  +  •••  +  \(ak)qx{&k)-  A  first-order  superquantile  risk  measure  is 
realized  by  setting  k  =  1. 

Note  in  general  that,  since  A  is  defined  on  i5r0,i) ,  we  exclude  the  possibility  of  a  weighting  measure 
that  places  a  positive  weight  at  a  =  1.  That  case  simply  yields  77(A)  =  oo  whenever  sup  A  =  oo,  and 
it  is  better  treated  separately. 

The  basic  properties  of  a  mixed  superquantile  risk  measure  are  described  by  the  following  result, 
where  certain  parts  are  immediate  from  the  definition.  The  result  extends  previous  results  in  [27,  28], 
which  built  in  turn  on  the  earlier  spectral  representation  of  Acerbi  [1],  by  dealing  with  a  significantly 
relaxed  condition  for  finiteness  and  admitting  the  point  f3  =  0  explicitly. 

2.3  Proposition  (mixed  superquantile  properties).  A  mixed  superquantile  risk  measure  1Z  as  in  (10)  is 
well-defined,  monotonic  and  positively  homogeneous.  It  is  regular  if  A({0})  <  1,  but  lacking  averseness 
if  A({0})  =  1.  Specifically, 

77(A)  >  E[X\  for  all  X  €  C2  and  77(A)  >  E[X]  for  nonconstant  X  unless  A({0})  =  1. 

It  is  finite  on  C2  whenever  the  weighting  measure  A  satisfies 

L  7T=p  “m  <  “ 

and,  regardless  of  the  weighting  measure,  has  IZ(X)  <  oo  whenever  sup  A  <  oo. 

It  has  the  alternative  expression 

77(A)  =  [  qx(/3)<p(/3)d/3,  where  (p(/3)  :=  [  —  dX(a),  f3  €  [0,1]. 

Jo  Jo<a</3  1  ~  a 

The  risk  profile  function  <p  is  right-continuous  and  nondecreasing  on  [0, 1]  with  ip( 0)  =  0  and  satisfies 
Jq(1  —  a)dip(a)  =  1.  Conversely,  any  with  these  properties  arises  from  a  unique  weighting  measure 
A  given  by  d\(a)  =  (1  —  a)dip(a). 

The  proof  of  this  proposition,  similar  in  some  ways  to  that  of  previous  versions  but  containing  new 
parts,  is  provided  in  the  Appendix.  Further  clarification  of  properties  of  mixed  superquantile  measures 
of  risk  has  been  furnished  in  [26,  Mixing  Theorem], 

Next  on  the  agenda  is  applying  this  general  result  to  the  case  in  (11)  that  corresponds  to  second- 
order  superquantile  measures  of  risk. 

eHere,  and  throughout  the  paper,  m  denotes  Lcbesgue  measure. 
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2.4  Theorem  (second-order  superquantile  properties).  Any  second-order  superquantile  risk  measure 
Ka  :  C?  ->  R,  a  €  [0, 1),  is  regular,  monotonic,  and  positively  homogenous,  and  satisfies  for  X  €  C2 


E[X]  <  qx(cv)  =  Ha{X)  <  min  E[X)  + 


2  a(X) 
\/l  —  a' 


sup  X 


with  the  lower  bound  holding  with  strict  inequality  whenever  X  is  nonconstant. 
It  has  the  alternative  expressions 

-i  /» 1  -i  /» 1 

na(X)  =  - -  /  qx(P)  log- - ^ d/3=  qx{P)<Pa(P)d/3, 

I-OtJa  1  -  P  Jo 

with  respect  to  the  risk  profile  function 


7>a(P)  ■= 


_l_logi5«  if  a  <  f3  <  1 
0  if  0  <  [3  <  a. 


Moreover,  ipa  is  a  nondecreasing,  finite  convex  function  on  [0, 1]  with  right-derivative  equal  to  1/(1  — a)2 
as  it  starts  to  grow  from  0  at  ff  =  a. 

Proof.  As  a  special  case  of  Proposition  2.3,  it  follows  automatically  that  IZQ  is  well-defined,  regular, 
monotonic,  positively  homogeneous,  and  bounded  from  below  by  E[X].  From  (6), 


na(x)  < 


1  —  a 


E[X]  + 


v/W 


d/3  =  E[X]  + 


o{X)  f1  1 
1  -  a  J a  \/l  -  P 


dA(/3)  =  E[X]  + 


2  a(X) 

v/1  —  OL 


Obviously,  1Za(X)  <  supX  also  holds. 

The  alternative  expression  follows  after  a  specialization  of  ip  of  Proposition  2.3  for  the  given  choice 
of  weighting  measure  A  =  \a.  Specifically, 


<P(P)  =  I  —^—d\Q(  7)  =  <pa(P) 

j 0<7</3  1  “  7 


fa  1^7  d1  ^  OL<P<\ 

0  if  0  <  p  <  a. 


Since  for  0  <  a  <  b  <  1, 


1 


dp 


log 


1  —  a 
1-6’ 


we  therefore  find  that  the  alternative  expressions  follow. 

The  assertion  about  (pa  being  convex  is  justified  by  its  derivative  being  zero  for  p  €  (0,  a)  and 
1/((1  —  a)(  1  —  P))  for  p  €  ( a ,  1),  with  left-  and  right-derivatives  at  P  =  a  equal  to  0  and  1/(1  —  a)2, 
respectively.  □ 


The  upper  bounds  on  TZa  (see  (6))  and  IZa  (Theorem  2.4),  the  latter  given  here  for  the  first  time, 
are  remarkably  similar.  They  show  that,  although  second-order  superquantile  risks  are  larger  than 
first-order  risks,  the  difference  is  at  most  o(X)/y/ 1  —  a. 
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3  Dualization  Through  Risk  Envelopes 


We  now  turn  to  determining  the  dual  expressions  for  mixed  and  second-order  superquantile  risk  mea¬ 
sures  in  terms  of  the  risk  envelopes  described  in  general  in  Proposition  2.1.  The  risk  envelope  Qa  that 
corresponds  to  the  first-order  superquantile  measure  TZa  in  (8)  has  already  been  indicated  in  (9). 

Another  case  where  the  risk  envelope  is  already  known  is  that  of  a  mixed  superquantile  measure 
TZ  associated  with  a  weighting  measure  A  that  is  concentrated  in  finitely  many  points.  Namely,  if 
TZ  =  X\TZai  +  •  •  •  +  \kTZak,  the  corresponding  risk  envelope  is  Q  =  Ai Qai  +  •  •  •  +  A kQak-  This  follows 
immediately  from  general  principles  of  convex  analysis  and  has  been  recorded  explicitly,  for  instance, 
in  [26].  In  the  case  of  a  nonatomic  probability  space  but  general  mixed  superquantile  risk  measures,  we 
find  properties  of  risk  envelopes  developed  in  [9,  Sections  4.5  and  4.6]  and  [32], 

For  arbitrary  probability  spaces,  the  literature  lacks  explicit  expressions  for  risk  envelopes  of  mixed 
superquantile  risk  measures  coming  from  a  weighting  measure  A  that  is  not  merely  discrete.  These  risk 
envelopes  ought,  by  analogy,  to  be  a  sort  of  “continuous  sum”  or  integral  of  various  sets  Qa  of  the  form 
in  (9),  and  the  contemplation  of  such  an  expression  raises  serious  technical  challenges  in  integration 
theory. 

We  take  on  those  challenges  here,  but  with  some  of  the  technical  background  details  placed  in  the 
Appendix.  Let 


M  :=  \q  :  [0,1)  ->  C? 


Q  ^  (%i),  £>£2)  -measurable,  J 


2  d\(/3)  <  oo 


Observe  that  M.  is  well-defined  because  by  Lemma  A. 5  (the  “A”  points  to  the  Appendix),  the  mapping 
/ 3  i->-  \\q(P)\\2  is  Br0) j ) -measurable  whenever  q  is  (/3r0ii), £>£2) -measurable. 

We  are  now  ready  to  deal  with  the  risk  envelope  of  a  mixed  superquantile  risk  measure  TZ  and  for 
this  purpose  utilize  a  collection  of  random  variables  in  terms  of  (Bochner)  integrals  of  elements  of  AL 
In  the  following,  we  let  1R  =  TR  U  {—00,  00}. 


3.1  Theorem  (risk  envelope  for  mixed  superquantiles).  For  a  mixed  superquantile  measure  of  risk  TZ 
with  associated  weighting  measure  A,  let 7 

Q  =  J  q{/3)  d\(/3),q  G  M,  q(/3)  G  Q/s  for  X-a.e.  /3  G  [0, 1)  j  , 

where  cl  denotes  closure  with  respect  to  the  (strong)  topology  on  C? .  Then  Q  is  nonempty,  convex, 
and  is  the  risk  envelope  for  TZ,  i.e.,  for  any  Ig£], 


Q:=d  Qg£ 


K(X)  =  sup  E[XQ\. 

QeQ 

Moreover,  if  J['  1/y/l  —  a  dX(a )  <  00,  then  Q  is  also  weakly  compact. 
Proof.  Let  X  G  C2  and  /  :  [0, 1)  x  C?  ->  TR  be  defined  by 


f{a,Q) 


— E[XQ }  if  Q  G  Qa 
00  otherwise. 


'We  note  that  Q  resembles  the  Aumann  integral  (see  for  example  [3])  of  the  set-valued  mapping  /3  1— >  Qp. 


In  view  of  Definition  A. 3,  /  is  a  normal  integrand  because  (i)  /  is  (£>[o,i)  <8>  ^£2)-measurable  as  the  sum 
of  the  continuous8  function  —  E[X-\  on  [0, 1)  X  £2  and  an  indicator  function  vanishing  on  the  set 

{{Pi  Q )  G  [0, 1)  x  C2  |  Q  G  Qg}  G  %i)  <8>  2 

and  infinity  elsewhere,  (ii)  f(P,Q)  >  —  E[XQ\  >  — oo  for  [5  G  [0,1)  and  Q  G  £2,  and  (iii)  for  all 
P  e  [0,1),  /(/?,-)  is  lower  semicontinuous  by  the  continuity  of  .E[X-]  on  £2  and  the  closedness  of 
Q/3  c  £2,  and  /(/3,  •)  is  not  identical  to  oo  with  Q  =  1  G  Q/j  furnishing  a  finite  value  f(j3, 1)  =  —E[X\. 
In  view  of  Proposition  A. 6  and  the  fact  that  q  =  1  provides  an  element  of  M  with  j  /(/?,  q(/3 ))  d\(/3)  = 
—E[X]  <  oo,  Proposition  A. 4  applies.  Consequently,  the  interchange  of  integration  and  minimization 
is  permitted  and  we  obtain  that 

7 Z(X)  =  f  sup  E[XQp\  d\(j3)  =  -  [  inf  f{/3,Q)  d\(/3) 

J  QpeQp  J  Q e£2 

=  -  inf  f  f{P,q(P))  d\(/3). 
q&M  J 

We  next  consider  the  interchange  of  integration  with  respect  to  A  and  P.  For  q  G  A4,  it  follows  from 
Lemma  A. 5  that  the  function  (/3,w)  i— )•  | X(uj)q(/3)(oj)\  is  measurable.  By  Tonelli-Fubini’s  Theorem  and 
Cauchy-Schwartz  inequality, 

J  \X(u>)qmu)\d{\  X  F)(/3,u>)  =  J  E[\Xq(P)\]  d\(/3)  <  \\X\\2  j  \\q(P)\\2  d\(P)  <  oo, 
where  the  finiteness  follows  by  the  property  of  q  G  M.  Then  by  Tonelli-Fubini’s  Theorem, 


J  E[Xq(/3 )]  d\(P)  =  E 


X 


q(P)  d\{P) 


Since 

J  dm  =  J  E[xq(p)]  dm 

whenever  q  G  M.  is  such  that  q(/3)  G  Qs  for  A-a.e.  (3  G  [0, 1)  and  f  f(/3,  q(/3 ))  d\((3)  =  oo  otherwise,  we 
find  that 


inf  [  f(P,q(P))  d\{P)  =  mf  (  [  E[-Xq(P)}  dX(P)  +  t(q) 

qeMj  q£M  I J 


=  inf  <  —  E 
q£M  { 


X 


q(P)  dX(p) 


+  t(q) 


where 


4<?) 


0  if  q{(3)  G  Qg  for  A-a.e.  (3  G  [0, 1) 
oo  otherwise. 


Compiling  the  above  results,  we  see  that 


K(X)  =  -  inf 
q&M 


f(P,q{P))  d\(P)  =  sup 

q&M 


E 


X  /  q(P)  dX(P) 


sup  E[XQ\. 
Q&Q 


sHcre  continuity  is  with  respect  to  the  product  topology  of  the  norm-topologies  on  [0, 1)  and  C2 . 
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The  convexity  of  Q  follows  from  the  convexity  of  Qp.  Since  1  £  Q,  Q  is  not  empty.  Under  the 
additional  assumption  that  f  l/\/l  —  a  d\(a)  <  oo,  1Z  is  finite- valued  on  C?  and  even  locally  bounded 
around  the  origin  of  C2  by  Proposition  2.3.  This  local  boundedness  for  a  positively  homogeneous  convex 
function,  as  the  support  function  of  a  set  Q ,  corresponds  to  that  set  being  bounded.  Consequently, 
Q  is  bounded.  Since  Q  is  convex,  weak  closedness  follows  from  strong  closedness  and  therefore  weak 
compactness  is  established.  □ 

For  the  special  case  of  a  second-order  superquantile  risk  measure  we  then  obtain  the  following 
corollary. 


3.2  Corollary  (risk  envelope  for  second-order  superquantiles).  For  a  £  [0, 1),  the  risk  envelope  for  lZa 
is  given  by 


Qa  • —  cl  \  Q  G  £ 


Q  = 


1  —  a 


q((3)d/3,q  £  M.,  q((3)  £  Qp  for  m-a.e.  (3  £  [a,  1) 


Moreover,  Qa  is  a  nonempty  weakly-compact  convex  subset  of  C2. 


In  addition  to  the  trivial  cases  when  A  and/or  P  are  positive  only  on  a  finite  number  of  points  in 
[0, 1)  and  fl,  respectively,  the  closure  in  the  definition  of  Q  is  unnecessary  under  the  following  condition. 


3.3  Proposition  (dispensing  with  the  closure  operation).  Suppose  that  A  is  nonatomic  and  also  that 
fo  1/(1  —  a)  dX(a)  <  oo.  Then  the  closure  operation  is  superfluous  in  the  expression  of  the  envelope  in 
Theorem  3.1.  One  can  simply  take 


Q=  <Q  £  £2 


Q=  q(/3)  dA((3),  q  £  M,  q{/3 )  £  Qp  for  X-a.e.  (3  £  [0, 1) 


Proof.  By  [6],  an  integrably  bounded  £>[0  ^-measurable  set- valued  mapping  S  :  [0, 1 )  ^  C2,  with  closed 
and  convex  values,  satisfies 


cl 


5(a)  dX(a) 


5(a)  dX(a) 


when  A  is  nonatomic.  Take  5  to  be  the  mapping  a  >-)•  {q{ot)  \  q  £  Ai,  q(a )  £  Qa},  which  obviously  is 
closed  and  convex  valued  by  the  properties  of  Qa.  Moreover,  since  both  [0, 1)  and  C?  are  separable, 
there  exists  a  countable  collection  ql  £  M. ,  such  that  5(a)  =  cl{//(a)  |  i  =  1,2, ...}  for  A-a.e. 

a  £  [0, 1).  Thus,  5  is  /3[0,i)"measurable;  see  f°r  example  [18,  Theorem  1],  The  mapping  5  is  integrably 
bounded  if  there  exists  a  B[o, immeasurable  g  ■_  [0, 1)  — >•  ]R  with  f  g(a )  dX(a )  <  oo  and 


sup  HQH2  <  g(a )  for  A-a.e.  a  £  [0, 1). 
QeS(a) 


Since  for  our  choice  of  5  we  have  that  every  Q  £  5(a)  has  Q(u)  <  1/(1  —  a)  for  a.e.  u  £  ST,  integrably 
boundedness  holds  with  g(a)  =  1/(1  —  a)  under  the  imposed  restriction  on  A.  □ 

Next,  we  turn  to  specific  expressions  for  risk  identifiers.  Recall  from  (7)  that  for  any  X  £  C?  and 
positively  homogeneous  regular  measure  of  risk  on  C2 ,  a  Q  in  the  risk  envelope  of  the  risk  measure 
that  maximizes  E  [X Q]  is  called  a  risk  identifier  at  X.  We  again  start  with  the  building  blocks  from 
first-order  superquantile  risk  measures. 
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For  X  £  C? ,  the  set 


Qa  :=  argrna xE[XQ] 

QClQol 

is  convex  and  nonempty  with  its  elements  referred  to  as  risk  identifiers  of  lZa ■  Before  we  characterize 
these  risk  identifiers,  we  introduce  additional  notation. 

For  /3  €  (0, 1),  let 

top(X)  :=  €  ft  |  X(u)=qx(P)} 

and  let 

Fx(x)  :=  lim  Fx(x'),  igfi 

x'  X  X 

be  the  left-continuous  “companion”  of  the  cumulative  distribution  function  Fx,  where  the  limit  exists 
by  the  virtue  of  F\  being  nondecreasing  and  bounded  from  above.  For  F\  continuous,  Fx  =  F^  of 
course. 

The  risk  identifiers  of  lZa  are  then  characterized  as  follows;  see  also  [31,  Equation  4.21]  for  closely 
related  expressions. 


3.4  Proposition  For  X  €  C?  and  f3  €  (0, 1),  let  r $  €  C?  be  such  that 


Fx{qx{P))  -  P 


0  <  r${uj)  <  - - -  for  a.e.  u ;  €  ft  and  /  r%  (u)dP(uj)  =  —  ~V  - 

1  ~  P  Jnp{x)  1  “  P 


X  r * 

Every  such  r-J ,  defines  a  unique9  Qp  ’  p  €  C?  given  for  a.e.  lo  €  ft  by 


X  rx 

Q?  "  M  ==  < 


flfrg  ifX(u)>qx(P) 
rp(u)  if  X{yS)  =  qx(P)  and  P({w})  >  0 
0  otherwise. 


(12) 


(13) 


Then, 


Q  =  Qp'p  for  some  rj  €  C2  satisfying  (12) 


Moreover, 

Q'o  =  {Q  €E  C2  |  Q(u>)  =  1  for  a.e.  uj  G  ft}. 

Proof.  Let  P  €  (0, 1)  and  X  €  C2 .  We  first  show  that  there  exists  an  €  C2  satisfying  (12).  For 
u  €  ft  satisfying  X(cu)  =  qx{P)  and  P({w})  >  0,  F^(X(uj))  <  ft  <  Fx{X(ui)),  with  at  least  one  of  the 
inequalities  being  strict,  and 


FX{X(uj))  —  (3 


(1  -P)(Fx(X(uj))-F-(X(uj))) 


e  [o,  i/(i  -  /?)]. 


9With  C2  consisting  of  equivalence  classes  of  functions  identical  up  to  on  a  set  of  P-measure  zero,  uniqueness  of  course 
is  in  the  sense  of  such  equivalence  classes. 
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Let  fgf  €  C2  be  defined  for  a.e.  u  £  fl  by 

(  Fx(X(  u))-p 


:=  )  (1  ~/3)(Fx(X(U))-F-(X(»))y 

[0  otherwise. 

Clearly,  f-g  satisfies  0  <  Eg  (uj)  <  1/(1  —  /3)  for  a.e.  u  £  Moreover, 


if  X(u>)  =  qx(P)  and  P({<u})  >  0 


IClp(X) 


rf  (c j)dP(w)  = 


Fx(qx{P))  -  P 


-rfP(w)  = 


Fx(qx{P))  -  P 


Inp(X)  (1  -P)(Fx(qx(P))-F-(qx(P))y--  1  ~P 

and  fg  therefore  satisfies  (12). 

X  rx 

Let  r*  £  C 2  satisfy  (12).  Since  0  <  Qg  p  (w)  <  1/(1  —  P)  for  a.e.  u  £  Q  and 

1 


Q^(w)dP(w)  = 


/  1  + 

|  x(oj)>qxm  1  “  P 

1  ~  Fx(qx{P))  ,  Fx(qx{P))  —  P 


rg  (w)dP(w) 


1-/3 


+ 


1-/3 


=  1, 


,X,rf 


we  find  that  Qg  p  £  Qp-  Moreover, 
X  rx 


E 


XQ 


0 


X(v) 


dF(uj)+  X(uj)rg(uj)dF{uj) 

(o;en  I  X{u:)>qx{P)}  1  -  P  Jcip(X) 


1 


1  -  p  h uen  |  X(u)>qxm 
/*oo 

/  X  dF%(x), 


X(uj)dP(u)  +  qx(P) 


Fx(qx(P))  ~  P 
1-/3 


(14) 


F%(x)  :  = 


where 

Fxi  j’gP13  if  Fx(x)  >  P 
v0  if  Fx(x)  <  p. 

It  is  well  known  (see  [25])  that  the  superquantile  qx(P)  =  fXx  dF^(x).  Thus,  we  have  proved  that 

X rx  X rx 

Qg’  p  maximizes  E[X-\  over  Qg.  Any  Q  £  Qg  not  equal  to  Qg  13  for  any  r *  must  necessarily  have 

E[XQ\  <  <i  x  (  3). 

The  case  of  /3  =  0  follows  also  as  then  Qo  =  {Q  £  £2  |  0  <  Q{oj)  <  1  for  a.e.  well,  E[Q\  =  1}.  □ 

A  particular  element  of  Qg  plays  a  central  role  in  the  following.  Let  fg-  £  C2  be  as  defined  in  (14). 
Consequently  by  Proposition  3.4,  Q g  defined  for  a.e.  ix  £  fl  by 


Qp  (w) 


:=  < 


f  jxg  if  X(u)  >  qX{P) 

fg(uj)  if  X(oj)  =  qx(P)  and  P({w})  >  0 
0  otherwise 


(15) 
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is  a  point  in  Q‘J .  Moreover,  let  Qjf  G  C2  be  defined  by  Qq(uj)  =  1  for  a.e.  w£il,  which  therefore  by 
Proposition  3.4  is  a  point  in  Qq  .  The  random  variable  Q *  behaves  continuously  in  (3  in  a  sense  given 
next. 


3.5  Proposition  If  f3u,f3  €  [0, 1)  and  j3v  — »•  (5,  then  for  any  X  G  C2 ,  \\Qp„  —  Qf\\2  —■ >  0. 

Proof.  Let  X  G  C2  and  f *  be  defined  in  (14)  and  (3  G  (0, 1).  Suppose  that  Fx(qx(P))—Fx(qx{P))  >  0. 
We  consider  two  cases. 

First,  suppose  that  f3u  — >  (3,  with  f3u  <  (3  for  all  v,  which  implies  that  (3  G  [Fx  {qx(P))i  Fx{qx(P))]-  If 
f3  G  (Fx(qx(P))1  Fx{qx(f3))\,  then  qx{Pv)  =  qx{P)  for  sufficiently  large  v.  Consequently,  for  sufficiently 
large  u, 


Of  111  =  [  (0  —  0)2dP(u;) 

J  {u\X(u)<qx{P)} 


+  [  -ff(u))2dP(uj)  +  f 

J<TLp(X)  J  {u\X{u)>qx{P)} 


1 

1-/3" 


1 


2 

dP(w). 


When  X(u)  =  qx{Pu)  =  qx{/3), 


r£v(uj) -r£(oj) 


Fx{qx((3))  —  (3U _ Fx(gxm  -  (3 

(1  -  /3" )(Fx(qx(l3))  -  Fx(qX(m  (1  -  /3)(Fx(qx(/3))  ~  Fx(qx(/3))) 


Hence,  all  three  terms  in  the  above  integral  vanish  as  v  — >  oo.  If  (3  =  Fx(qx(f3)),  then  we  only  have 
that  qx(Pu)  Fqx{P)  by  the  left-continuity  of  qx  and  in  fact  qx{Pu)  <  qX(P))  for  all  v.  Consequently, 


\\Q^-Qf\\22  = 
+ 

+ 

+ 


'{u)\X(u)<qx(01')} 


(0  -  0)2dP(w) 


(fpv(u])  —  0)2dP(w) 


l{u)\qx(P1')=X(uj)<qx(P)} 


1 


dF(u ,) 


{u\qxU31')<X(u)=qx(0)}  \I 

1 


1 


w). 


’  W\qx(Pv)<qx{P)<X{u)}  V 1  I  P, 

Of  the  four  integrals,  the  first  and  fourth  ones  obviously  tend  to  zero.  For  the  second  one,  we  see  that 


P(H qx(n  <  X(u>)  =  qxm )  =  Fx(qx(n)  ~  Fx{qX{Pv))  <  Fx(qXm  -  F^(qx(Pv))  0 


by  the  left-continuity  of  F~x  and  consequently  the  integral  also  tends  to  zero.  For  the  third  integral,  we 
find  that  when  X(uj)  =  qX(/3) 


*x  /  \  =  _ Fx(qxiP))  ~  P _ =  Fx  (qx  {P) )  ~  Fx  (qx  (P) )  =  1 

^  ^  (1  -  P)(Fx(qx(P))  ~  Fx{qx{P)))  (1  “  P)(FX(qX(P))  ~  Fx(qx((3)))  1  -  P' 

Consequently,  the  third  integral  also  tends  to  zero. 
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Second,  suppose  that  (3U  — >  (3,  with  pv  >  (3  for  all  v.  If  (3  G  [F^(qx(P)),  Fx{qx{P))),  then 
Qx((3u)  =  qx(P)  for  sufficiently  large  u  and  the  corresponding  argument  for  the  first  case  still  holds.  If 
(3  =  Fx(qx(/3)),  then  we  only  have  that  qx{Pu)  >  qx(P))  for  all  v.  Consequently, 


\\Q~pv  QX  |||  — 
+ 

+ 

+ 


(0  -  0)2dP(w) 


J  {u\X(u>)<qx(/3)} 

f 

I  {u]\qx{P)=X(u>)<qx(P'')} 

f 

I  {u\qx(0)<X(u)=< ?x(/3^)} 


(0-r^(w))2dP(w) 


1 


1 


’{u\qxO 0)<qx(Pv)<x(w)}  \1  Pv  1  P 

The  first  and  fourth  integrals  obviously  tend  to  zero.  For  the  second  one, 


w). 


Fx{qx{(3))  -  f3 


px,u\  = _ rX\qx\P))  -  p _ =  Fx(gx((3))  -  Fx(gx(P)) 

(1  -  P)(Fx(qXm  ~  Fx(qxm)  (1  -  / 3)(Fx(qx(P ))  -  F^{qx{P))) 


=  0 


and  consequently  a  zero  integral.  For  the  third  integral, 

FX(qxm )  -  Pv 


rfv{u)  = 


(1  -  P" )(Fx(qxWv ))  -  F^(qx(Pv)))  1  -  P 


if  qx{Pv)  remains  bounded  away  from  qx(P)  because  then  Fx(Pn  Fx(P)  =  p.  If  qx(Pn  ^  qx(P), 
then  by  the  right-continuity  of  Fx  we  have  that 


P({u;  G  n\  qx((3)  <  X(u)  =  qx(Pv)})  =  Fx{qx{Pv))  ~  F^(qx(Pu))  <  FX{qx{Pv))  ~  Fx(qx(P))  -►  0. 


Consequently,  the  third  integral  also  tends  to  zero. 

The  situation  with  Fx(qx(P))~  Fx  ( Qx(P ))  =  0  follows  with  similar  and  in  fact  simplified  arguments 
as  in  that  case  Fx  is  continuous  at  qx(fi)  and  qx  is  continuous  at  p. 

Finally,  we  consider  the  case  with  p  =  0  and  Pv\  0.  Then, 


\\Qpv  —  O0II2  — 
+ 


’{u\X(u)>qx^)}  V1  ~  P 

{fpv(u)  —  l)2(iP(w)  +  f 
)  J  f< 


-  1  dP(w) 


'<V(x) 


{u)\X(w)<qx{Pv)} 


(0-  l)2dP(u;). 


Since  1/(1  —  Pv)  — >•  1,  the  first  integral  vanishes.  The  last  two  integrals  vanish  since  their  integrands 
are  bounded  and  Fx(qx(Pu))  — >•  0.  □ 

We  are  then  in  a  position  to  characterize  risk  identifiers  of  mixed  superquantile  risk  measures.  For 
X  G  £2,  let 


Qx  :=cUQe£ 


Q  =  J  q(P)  d\(P),  q  G  M,  q(P)  G  QX  for  A-a.e.  P  G  [0, 1) 


(16) 
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3.6  Theorem  (risk  identifiers  for  mixed  superquantiles).  For  X  £  C2 ,  the  set  Qx  is  convex  and 
satisfies  the  following. 


(i)  If  Q  £  Qx ,  then  Q  is  a  risk  identifier  of  IZ  at  X. 

(U)  Iffl/VT  —  j3  d\(/3 )  <  oo,  then  Qx  is  nonempty  and  weakly  compact,  and  Q  £  Qx  whenever  Q 
is  a  risk  identifier  of  IZ  at  X.  Moreover,  Q  :=  f  q(/3)  dX(f3),  where 

q  :  [0, 1)  — >  C2 ,  with  q(/3)  =  QX  (defined  in  (15))  for  all  (3  £  [0, 1), 

is  furnishing  an  element  of  Qx . 


Proof.  We  first  consider  (i).  Let  Q  €  Qx.  There  exists  sequences  {Qv}^=1  C  C2  and  {qu}^=l  C  M 
such  that  || Qv  —  Q\\ 2  — >•  0,  Qv  =  f  qu(/3)  dX(/3),  and  qu(/3 )  G  Qx  for  all  v  and  A-a.e.  (3  G  [0, 1).  Then, 
for  every  v, 


nx)  = 


E[x<rm  dx(/3 )  =  e 


X 


qv{d)  dX(0) 


e[xqu], 


where  the  middle  equality  follows  by  the  same  argument  as  in  the  proof  of  Theorem  3.1.  Since  by  the 
Cauchy- Schwartz  inequality  E[XQU]  — >•  E[XQ\,  we  also  have  that  IZ(X)  =  E[XQ\,  which  establishes 

(i)- 

Next,  we  consider  (ii).  Suppose  that  f  1/yT  —  (3  dX(/3)  <  00.  We  proceed  toward  a  contradiction. 
Suppose  that  Q  £  Q  is  a  risk  identifier  of  IZ  at  X,  but  Q  0  Qx .  Then  there  must  exists  a  q  G  M.  and 
B  G  such  that  q(f3)  G  Qp  for  A-a.e.  (3  G  [0, 1),  A (B)  >  0,  and  q((3)  0  Qx  for  all  (3  £  B.  However, 
this  implies  that  E[Xq(f3)\  <  E[XQx]  for  all  (3  £  B  and  any  Qx  £  Qx .  Consequently,  E[XQ\  <  IZ(X), 
which  is  a  contradiction. 

Since  Q  is  weakly  compact  by  Theorem  3.1,  the  weak  compactness  of  Qx  follows  from  it  being 
a  closed  convex  subset  of  Q.  Finally,  we  show  that  Q  £  Qx .  The  conclusion  follows  when  we  have 
shown  that  q  £  M.  By  Proposition  3.5,  q  is  continuous  and  therefore  (Hr0>i),  H£2)-measurable.  Since 
for  (3  £  (0, 1) 


II Qp  III  — 


'Lefi  |  X(u)>qxm  (!  -I3)' 


rdP(o;)  + 


Iftp(X) 


Fx(qx(f3))  -  (3 


(l-P)(Fx(qXm-F-(qx(l3)))\ 


dP{u) 


+ 


Fx(qx((3))  -  (3 


(1  —  (3)2  '  L(1  -/3)(Fx(qXm-Ex(qXm)\ 


(Fx(qxm-Fx(qx(m 


+ 


(■ Fx(qXm-/3 )S 


1-/3  (1  -  mFxiqxm  -  Fx(qx((3))) 

<  1  1  (1  —  /3){Fx(qx(/3))  —  f3) 

-1-/3  (1  -  mFx(qxm  -  Fx(qx((3))) 


< 


1 


+ 


Fx(qx((3))  ~  Fx(qx(l 3)) 


1-/3  (1  -  (3)(Fx(qXm  -  Fx(qx(l3)))  1  -  P 


and  ||Q5V  H2  =  1,  we  find  that 


dX(/3)  <  V2  J 


dX((3)  <  00. 
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Consequently  q  G  M  and  Q  =  f  q(/3 )  dX((3)  G  Qx,  which  complete  the  proof.  □ 

We  observe  that  when  f  1/y/l  —  (3  dX((3)  =  oo,  there  are  random  variables  X  G  C?  with  77(A)  =  oo. 
In  this  case  it  might  not  be  necessary  to  select  q  in  (16)  with  q(/3)  G  Qx  for  A-a.e.  (5  G  [0, 1)  because 
f  E[Xq((3)\  dXQ3)  might  still  be  infinity.  For  the  special  case  of  a  second-order  superquantile  risk 
measure,  we  directly  obtain  the  following  corollary  without  this  complication. 


3.7  Corollary  For  a  €  [0, 1)  and  X  G  C2 ,  the  set 


Qa  ■=  cl  <Q  G  C 


Q  = 


1 


q(/3)d/3,  q  €  M,  q(/3)  G  QX  for  m-a.e.  /3  G  [a,  1) 


1  —  a 


is  nonempty,  convex,  and  weakly  compact.  Moreover, 

Q  G  Qx  if  and  only  if  Q  is  a  risk  identifier  of  7Za  at  X. 


Further  simplifications  are  possible  in  the  case  of  second-order  superquantile  risk  measures.  As 
usual,  we  interpret  0  times  — oo  as  zero  in  the  following. 

3.8  Theorem  (further  characterization  of  second-order  superquantile  risk  identifiers).  For  X  G  C2 
and  a  G  [0, 1),  Qx  is  the  closure  of  elements  Qx  G  Qa  given,  for  a.e.  u  G  fl,  by 


Qa  M  =  \ 


l 

1 —a 


log  jhfftj  +  fj$  rf(u)dp  I  if  a  <  f(u)  <  1 


0 


if  f(u>)  <  o  <  F(u) 
otherwise, 


where  rx  G  C?  satisfies  (12)  and  F(uj)  :=  Fx(X(u))  and  f(u>)  :  =  Fx(X(ui)). 

The  specific  choice  fx  G  C2  given  in  (14)  results  in  the  risk  identifier  Qx  G  Qx  having,  for  a.e. 
u>'  G  11. 

f  A  log  A” 


Qa  H  =  < 


1— a 
1 

1 —a 

o 


-FH 

1— a 


l°gT^)+l  + 


l-F(w) 

F{u)-f(u) 


1—F{ui) 


F(uj)—a  ,  1— F(a>)  ,  x— r  ^ 

F(u)-f(w)  +  FM-/(G  i0§ 


1-, 

'(u>) 

f(u) 

a 

if  a  <  /( (jS)  =  F(u>)  <  1 

if  a  <  f(oj)  <  F(ui) 

if  f(oj)  <  a  <  F(yj )  and  f(oj)  <  F(u) 

otherwise. 


Proof.  For  ui  G  such  that  a  <  Fx(X(co))  <  1 

f  1 


hmoc, i)  i  x{uj)>qxm  1  _  P 

By  Proposition  3.4, 

1 


d/3  =  [—  log(l  —  /?)] 


F~{X(lo)) 


=  log 


1  —  a 


QaH  = 


1  —  0 
1 

l  —  o 


d/3  + 


/{/9G(a,  1)  |  X{u)>qxm  1  -  P  J{P G(a.l)  |  X{u)=qx  W)} 


l-Fx(X(u)) 
rf(u)d(3 


(17) 


log 


l  —  o 


rFx(X(u)) 


+ 


rf(oj)d/3 


1  -  FX  ( X (w))  Jf~  (X(w)) 
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which  proves  the  first  claim.  The  second  claim  follows  by  a  similar  argument. 

We  next  turn  to  the  specific  choice  of  rx .  For  a  <  F~x  (X(uj))  =  Fx(X(u ))  <  1,  the  conclusion 
follows  trivially.  For  a  <  Fx  (X(oj))  <  Fx(X(u)),  integration  gives  that 


rFx(X(co)) 


rf(u)d/3  = 


f  Fx(X(u)) 


/F~(X(  W)) 


Fx(X(cu))  -  p 


'f-(xH)  (1  -  P)(Fx(X(u))  -  Fx(X(u))) 


d/3 


=  1  + 


1  -Fx(X(cj)) 


log 


1  -Fx(X(u)) 


Fx(X(u))-F-(X(u))  bl-F-(X(uj)y 


and  the  corresponding  conclusion  follows.  The  last  case  follows  by  a  similar  calculation.  □ 

The  situation  is  especially  simple  for  the  following  case. 


3.9  Corollary  Suppose  that  F\  is  continuous  for  X  €  C2  and  a  €  [0, 1).  Then,  Q £  is  a  singleton10 
with  element  given,  for  a.e.  oj  €  by 


Q 


x, 


u;  = 


_ 1 _  lo£f  _ f _ — _ 

1  -a  iU&  1  -Fx(X(u)) 


if  a  <  Fx{X{oj))  <  1 
otherwise. 


It  is  obvious  that  expressions  of  risk  identifiers  provide  alternative  expressions  for  risk  measures. 
Specifically,  for  X  £  C2, 

'R(X)  =  sup  [  X(u)Q{oj)dP{yj)  =  f  X (co)Qx (u)dP(uj) , 

QgqJ  J 

for  any  Qx  €  Qx .  In  the  case  of  the  previous  corollary,  it  is  easy  to  see  that  the  second-order 
superquantile  risk  takes  the  simple  form 


n«{x)  = 


i 


1  —  a 

- TTVZXdFx{x), 


l~aUx{ot)  1  -  Fx(x) 

where  qxip)  =  — oo  for  a  =  0,  which  complements  the  expression  of  Theorem  2.4. 


4  Applications  to  Optimization  and  Regression 

In  applications  arising  in  optimization  under  uncertainty  and  generalized  regression,  one  is  not  only 
interested  in  the  risk  of  a  single  random  variable  X ,  but  rather  of  a  parameterized  family  of  random 
variables  over  which  the  “best”  is  to  be  selected  according  to  some  criterion  and  constraints.  When 
the  criterion  and/or  the  constraints  are  given  in  terms  of  measures  of  risk  applied  to  this  family  of 
random  variables,  we  obtain  optimization  problems  involving  parameterized  risk.  Properties  of  these 
measures  of  risk  as  functions  of  the  parameters  as  well  as  formulae  for  the  functions’  (sub)gradients 
become  central.  In  this  section,  we  discuss  optimization  problems  involving  parameterized  mixed  and 

10Again,  uniqueness  is  up  to  on  a  set  of  P-measure  zero. 
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second-order  superquantile  risk.  In  particular,  we  see  that  risk  identifiers  developed  in  this  paper  are 
central  for  expressions  of  subgradients. 

We  consider  a  family  of  random  variables  Xu  =  g(u,-),  u  €  1?”,  generated  by  the  function  g  : 
Mn  x  fl  — >•  JR.  Consistent  with  the  previous  sections,  we  assume  that  Xu  €  C2  for  all  u  €  Mn  ■  For  a 
weighting  measure  A  and  the  corresponding  mixed  superquantile  risk  measure  1Z,  as  before  given  by 

U{Xu)  =  J  qXu  (/3)  d\(P)t 

we  get  a  function 

f(u):=U(Xu ),  u  €  Mn ,  (18) 

representing  parameterized  risk.  One  might  then  proceed  with  determining  a  u  €  Mn  that 

minimizes  f(u)  over  a  subset  of  Mn 

or,  alternatively,  with  determining  a  «£  Mn  that 

minimizes  some  criterion  function  of  u  subject  to  f(u)  <  0  and  possibly  other  constraints. 

Algorithms  such  as  cutting  plane  and  bundle  methods  for  solving  these  optimization  problems  require 
expressions  for  (sub)  gradients  of  /.  Justification  for  these  approaches  is  provided  by  the  Convexity 
Theorem  of  [26],  which  establishes  that  /  is  convex  whenever  g(-,ut)  is  convex  for  a.e.  oj  €  fl. 

In  the  remainder  of  the  paper,  we  give  expressions  for  subgradients  of  /,  but  refrain  from  discussing 
full  algorithms;  see  for  example  [16,  13,  30]  for  risk  minimization  algorithms  based  on  dual  approaches 
and  [31]  for  related  subgradient  expressions.  However,  we  end  the  paper  with  a  discussion  of  primal 
and  dual  methods  in  the  context  of  superquantile  regression. 

4.1  Subgradients  of  Parameterized  Risk 

We  restrict  the  attention  to  the  case  with  f  \/\J\  —  a  d\{a)  <  oo  which  ensures  the  finiteness  of  1Z  on 
C?  and  also  the  weak  compactness  of  Q.  We  equip  JRn  x  C?  with  the  product  topology  generated  by 
the  norm  topology  on  Mn  and  the  weak  topology  on  C2 .  The  convergence  of  points  in  lRn  X  C2  in  this 
weak  sense  is  denoted  by  — >• w . 

For  notational  convenience,  we  let  h  :  lRn  x  C2  — >•  JR  be  given  by 

h(u,Q)  :=  f  g(u,u>)Q(u})dF(u>).  (19) 


Properties  of  this  function  are  recorded  next.  The  development  here  is  rather  standard  (see  for  example 
[33]),  but  is  included  as  it  explicitly  shows  the  central  role  of  risk  identifiers. 

4.1  Proposition  Consider  h  in  (19)  and  suppose  for  an  open  set  U  C  Mn  that 

(i)  there  exists  an  L  G  C2  such  that 

\g(u,  uS)  —  g(u',uj)\  <  L(oj)\\u  —  ■u/||  for  all  u,vf  G  17  and  a.e.  wCl 
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(ii)  for  every  i  =  1  there  exists  an  fl,;  C  fl,  with  F’jOi }  =  1,  and  an  Li  G  C2  such  that 

dg(u,uj)/dui  exists  for  u  G  U  and  co  G  fij,  and 


dg(u,u) 

dui 


—  ’ —  <  LAuj)\\u  —  u' II  for  all  u,u'  G  U  and  u  £ 

OUi 


(in)  g(y,  -),dg(vl,  ■ )/dui  G  C2  for  some  v,  vl  G  U,  i  =  1, n. 

Then,  h  is  weakly  continuous  on  U  x  C2  and  \7uh  exists  and  is  likewise  weakly  continuous  on  U  x  C2 . 

Proof.  First  we  consider  h,  which  is  well-defined  and  finite  on  U  x  C2  from  assumptions  (i)  and 
(iii).  Suppose  that  ( uu,Qu )  ~^w  (u,Q) ,  with  uv ,u  G  U  and  QU,Q  G  C? .  Then  by  the  triangle  and 
Cauchy-Schwartz  inequalities  and  assumption  (i), 

\h(uv ,  Qu)  —  h(u,  Q)\  <  J[g(uu,n)  -  g(u,n)]Qu(uj)(lF(uj)  +  J  g(u,n)[Qv (n)  —  Q(n)]dF(n) 

<  II g(uv,  •)  -  g(u,  OlhllQHh  +  J  g(u,u)[Qu(m)  -  Q(n)\dF(uj) 

<(F1[L2])1/2||^_U||||Q-||2+  J  g(u,u)[Cn^)-Q(U)]dP(u) 

By  the  Uniform  Boundedness  Principle,  is  bounded  and  the  first  term  therefore  vanishes. 

Since  assumptions  (i)  and  (iii)  imply  that  g(u,u>)  G  C2  for  all  u  G  U,  the  second  term  vanishes  by  the 
weak  convergence  of  Qu  to  Q. 

Second  we  consider  Vu/i.  Following  a  standard  argument  and  the  Dominated  Convergence  Theorem 
(see  for  example  the  proof  of  Theorem  7.44  in  [33]),  we  find  that  for  every  u  G  U  and  Q  G  C2,  Vuh(u ,  Q ) 
exists  and  is  given  by 

Vuh(u,Q )  =  J  Vug(u,n)Q(n)dP(uj). 

Repeating  the  above  argument  with  g  replaced  by  dg/dm  and  assumption  (i)  by  assumption  (ii)  estab¬ 
lishes  the  claim  about  V„/i.  □ 


In  view  of  Proposition  4.1,  the  following  conclusions  is  a  direct  consequence  of  [29,  Theorem  10.31]. 

4.2  Theorem  (subdifferentiability  of /).  Suppose  that  the  assumptions  of  Proposition  4.1  holds.  Then, 
f  in  (18)  is  locally  Lipschitz  continuous  on  U  and  strictly  differentiable 11  where  it  is  differentiable.  There 
exists  a  set  D  C  U  such  that  U  \  D  is  negligible12,  f  is  differentiable  on  D,  and  the  gradient  V/  is 
continuous  relative  to  the  set  D. 

Moreover,  the  directional  derivative  of  f  at  u  G  U  in  direction  v  G  Mn  is 

df(u)  (v)  =  max  j  (E  [Vug(u,  -)Q] ,  v)  Q  G  Q9^  j 

11Recall  that  /  :  Rn  — >  1R  is  strictly  differentiable  at  a  point  x  if  f{x)  is  finite  and  there  is  a  vector  v  £  IRn  such  that 
(f(x')  —  f(x)  —  ( v,x'  —  x))/\x'  —  a;|  — >  0  whenever  x,x'  — »  x  and  x'  ^  x;  see  [29,  Definition  9.17]. 

12A  subset  of  a  set  of  Lebesgue  measure  zero  is  negligible. 
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and  the  subdifferential  of  f  at  u  £  U  is 


df(u )  =  con  [E  [Xug(u,  -)Q]  |  Q  €  Q9^  }  , 
where  Q9(ur)  js  given  in  (16)  with  X  replaced  by  g(u,  •). 


□ 

We  observe  that  when  A  =  Aa,  i.e.,  the  focus  is  on  a  second-order  superquantile  risk  measure  7Za, 
then  Q9(u>')  is  fully  characterized  by  Theorem  3.8.  In  particular,  the  latter  half  of  that  theorem  provides 
a  specific  risk  identifier  Q  £  Qs(.ur)  that  is  easily  calculated  when  fl  has  finite  cardinality.  Such  a  risk 
identifier  then  provides  the  subgradient  E[Xug(u,  ■ )Q ]  of  /,  which  also  is  easily  calculated  in  this  case. 


4.2  Application  to  Superquantile  Regression 

Superquantile  regression  as  laid  out  in  [23]  (see  also  [21]  and  [14],  which  also  includes  several  appli¬ 
cations)  resembles  quantile  regression,  but  instead  of  estimating  conditional  quantiles  focuses  on  con¬ 
ditional  superquantiles;  quantifies  of  substantial  interest  as  demonstrated  by  the  numerous  references 
listed  in  [23].  In  particular,  the  approach  of  estimating  a  weighted  average  of  conditional  quantiles  (see 
for  example  [5])  can  be  viewed  as  an  approximate  approach  to  estimating  conditional  superquantiles 
as  realized  from  (1).  The  approach  given  in  [23]  is  more  direct  and  avoid  approximations  of  this  kind. 
Specifically,  we  find  that  for  Y  £  C2  and  a  €  (0, 1), 

(gy  («)}  =  argmiri £a(Y  —  it0),  where  £a(Y)  :=  Va(Y)  —  E[Y ] 
uo£R 

is  a  measure  of  error  given  in  terms  of  the  measure  of  regret 13  Va  defined  in  (5).  In  the  same  manner  as 
minimizing  mean-squared  error  yields  an  expectation  and  the  foundation  for  least-squares  regression, 
and  minimizing  a  Koenker-Basset  error  yields  a  quantile  and  the  foundation  for  quantile  regression, 
minimizing  £a  leads  to  superquantile  regression. 

Superquantile  regression  deals  with  the  problem  of  approximating  a  random  variable  Y  £  C2  by  a 
combination  of  more  accessible  random  variables  X\ ,  X2, ...,  Xn  £  C2 ,  such  that  the  error  as  quantified 
by  £a  is  minimized.  Hopefully,  the  knowledge  of  X  =  (X\, Xn)  would  then  provide  reasonably 
accurate  predictions  of  Y .  We  stress  that  the  terminology  “regression”  is  here  used  more  broadly  than 
in  the  statistics  literature.  Limiting  the  scope  to  affine  regression  functions,  superquantile  regression 
then  needs  to  solve  the  problem 


min  {Y  -  [u0  +  («,  A)]) 

uo&R;u£R 

to  obtain  regression  coefficients  uq  and  u.  That  is,  the  regression  coefficients  (uq,u)  are  selected  such 
that  the  error  between  Y  and  the  model  uq  +  (u,  X)  is  minimized. 

We  show  in  [23]  that  this  problem  can  be  decomposed  into  the  two  problems 

1  fl 

(i)  find  u  £  argmin  — —  /  qg(Ut.){P)d/3  -  E[g(u,  •)]  and  (ii)  find  u0  =  (a), 

u£Rn  t  ®  J a 

13We  refer  to  [26]  for  a  general  treatment  of  measures  of  error  and  regret. 
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where  for  each  u  G  Mn 


g(ur)  =  Y  -  (u,X) 

is  a  random  variable  defined  on  the  sample  space  Q  =  Mn+1,  with  sigma-algebra  BRn+i,  and  probability 
P  given  by  the  distribution  of  (A',  Y).  The  problem  (i)  is  that  of  minimizing  a  second-order  superquantile 
of  g(u,  •)  minus  the  expectation  of  g(u,  •).  Since  E\g(u,  •)]  =  E[Y]  —  (u,  E[X])  is  a  deterministic  quantity, 
this  problem  is  essentially  in  the  form  discussed  earlier  in  the  section:  to  minimize  a  mixed  superquantile 
risk  measure,  in  fact  a  second-order  superquantile  risk  measure. 

Suppose  that  the  distribution  P  is  supported  on  the  points  {(x-7,  y-?')}J=1  C  IRn+l  with  P{(xJ,  yi)}  = 
p7,  j  =  1, ...,  is,  as  is  the  case  in  practice  when  the  regression  relies  on  the  observed  data  { (x-7 ,  y3 ) }j= j . 
Then,  the  evaluation  at  a  given  u  €  Mn  of  the  objective  function 

f(u)  =  — f  qg(u,.){P)d/3  -  E\g(u,  •)] 

of  problem  (i)  and  a  corresponding  subgradient  are  achieved  as  follows:  Determine  the  cumulative 
distribution  function  of  g{u,  •)  and  use  the  formula  in  the  second  half  of  Theorem  3.8,  with  X  replaced 
by  g(u,  •),  to  determine  a  risk  identifier  Qa  U’  \  This  computation  can  be  obtained  in  0{y  logzz)  time, 
with  sorting  of  {yJ  —  (u,  :Y)}'-=1  to  obtain  the  cumulative  distribution  function  being  the  bottleneck. 
Then,  in  view  of  Theorem  4.2,  the  function  value  and  a  subgradient  are  readily  available  through 

V  V 

/(«)  =  y 3  -  3  _  (u’x3)) 

j= 1  i=1 

and 

V  V 

v f(u)  =  22  —v^x^Q92u’\u3)  +  22/P3 xJ ’  w^ere  u3  =  (x3 > v3 ) ■ 
j=1  j=l 

We  note  that  the  assumptions  of  Proposition  4.1  are  easily  verified  in  this  case  due,  in  part,  to  the 
affine  form  of  g(-,  oj).  Consequently,  each  iteration  of  a  cutting-plane  method  or  bundle  method  requires 
therefore  computational  time  of  order  0(v  log  v)  as  a  function  of  the  number  of  data  points.  The  number 
of  iterations  needed  would  depend  on  the  method,  n  (the  number  of  explanatory  variables),  and  other 
factors.  In  comparison,  a  “primal”  method  proposed  in  [23]  for  solving  the  same  problem  requires  the 
solution  of  a  linear  program  with  n  +  0(u2)  variables  and  0(v2)  inequality  constraints.  It  is  therefore 
clear  that  for  small  n  and  large  u.  which  is  typical  in  regression  problems,  a  dual  method  relying  on 
the  expressions  derived  in  this  paper  might  outperform  the  linear-programming-based  approach;  see 
[14]  for  empirical  evidence  supporting  this  claim.  In  fact,  even  storage  of  the  linear  program  becomes 
challenging  for  large  u. 

A  Appendix 

As  support  for  proving  Proposition  2.3  in  Section  2,  we  need  the  following  consequence  of  the  Fubini- 
Tonelli’s  Theorem. 

A.l  Proposition  Suppose  that  (Y,  A ,  p)  and  (T,  B,  u)  are  sigma-finite  measure  spaces.  If  f  :  X  x  T  — t 
IR  is  measurable  with  respect  to  the  product  sigma-algebra  on  X  x  y  and  g  :  X  x  y  — >•  1R  is  integrable 
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with  respect  to  the  product  measure  p  x  v,  with  f(x,  y)  >  g(x,  y )  for  (p  x  v)-a.e.  (x,  y)  €  X  x  y,  then 
the  following  hold: 


0) 

(») 

(Hi) 


the  function  hi  =  f  f(x,  •)  dp(x)  is  B -measurable, 
the  function  h-2  =  /  f(-,y )  dv{y)  is  A-measurable, 
and 


j  f  d(px  v) 


f(x,y )  dp(x) 


dv{y)  = 


f(x,y)  dv{y) 


dp[x). 


Proof.  We  recall  that  the  integral  of  the  sum  of  a  nonnegative  measurable  function  and  an  integrable 
function  equates  the  sum  of  the  individual  integrals  under  the  usual  rules  for  handling  addition  with 
infinity.  Then, 


hi  = 


f(x,-)dp(x ) 


J  (/  -  g)(x,-)dp(x)  + 


g(x,  -)dp(x) 


is  ^-measurable  since  both  terms  on  the  right-hand  side  are  ^-measurable  by  the  Fubini-Tonelli  The¬ 
orem.  A  similar  argument  yields  the  conclusion  for  h-2-  The  final  assertion  follows  by  applying  the 
Fubini-Tonelli  Theorem  to  /  —  g  and  g ,  and  the  above  rule  about  interchange  of  summation  and  inte¬ 
gration.  □ 


Proof  of  Proposition  2.3.  For  every  X  €  C2,  qx  is  continuous  and  finite  on  [0, 1)  and  therefore 
/T), ^-measurable.  Moreover,  qx  >  E[X\  and  therefore  TZ(X)  >  E[X\  >  — oo.  Consequently,  TZ  is  well- 
defined  with  values  in  [E'[A],oo].  Its  regularity  and  positive  homogeneity  follow  directly  from  those  of 
1ZQ ;  see  [26].  Since  qx  is  strictly  increasing  on  [0,1)  for  nonconstant  X ,  we  have  that  if  A({0})  <  1, 
then 


K(X)  =  £[X]A({0})  +  [  qx(0)  d\{^)  >  £[X]A({0})  +  E[X}(  1  -  A({0})  =  E[X] 

J  1>/J>0 


and  the  strict  lower  bound  follows.  From  (6), 

n{X)  <  E[X]  +  -^2=  dm  =  E[X\  +  o(X)  £  -J±=  dm  <  oo 

under  the  stated  assumption,  which  establishes  the  corresponding  finiteness  on  £2.  In  the  case  of 
sup  A  <  oo,  finiteness  of  TZ(X)  follows  trivially. 

We  next  consider  the  alternative  expression.  By  definition, 


nx)  = 


qx(/3)ip(a,/3)df3 


d\(a ), 


(20) 


with  '(/’(«)/?)  =  if0<a</3<l  and  f3)  =  0  otherwise.  We  equip  [0,1)  X  (0,1)  with 

the  product  measure  A  x  m  defined  on  the  product  sigma-algebra  B\ 0;i)  ®S(0ii).  It  is  obvious  that  ip  : 
[0, 1)  x  (0, 1)  — >•  Ft  is  (Z3[o,i)®f5(o,i))-measurable  and  likewise  qx,  viewed  as  a  function  on  [0, 1)  x  (0, 1)  that 
is  constant  in  its  first  argument,  due  its  monotonicity.  Consequently,  the  function  (a,  (5)  i->-  qx(/3)il>(a,  (5) 
is  measurable  in  the  same  sense.  Then,  we  look  toward  the  interchange  of  integration  order  in  (20). 
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We  consider  three  cases,  (i)  Suppose  that  X  >  0  a.e.  Then,  qx  >  0  and  q; \ip  >  0,  and  the 
interchange  of  integration  order  is  permitted  by  Tonelli-Fubini’s  Theorem,  (ii)  Suppose  that  X  <  0 
a.e.  Then,  —qx  >  0  and  —qx'J’  >  0,  and  the  interchange  of  integration  order  is  again  permitted  by 
Tonelli-Fubini’s  Theorem,  (iii)  Suppose  that  neither  (i)  nor  (ii)  holds.  Then,  there  exists  a  fix  €  (0, 1) 
such  that  qx(P)  >  0  for  (3  >  Px  and  qx(P)  <  0  for  P  <  Px-  In  view  of  Proposition  A.l,  it  suffices  to 
find  an  integrable,  lower-bounding  function  of  qx'J’-  Let  g  :  [0, 1)  x  (0, 1)  — >•  JR  be  given  by 


g{a,f3) 


qx(/3)/(l  -  fix)  if  0  <  a  <  P  <  Px 

<  qx(/3 )  if  0  <  a  <  f3  <  1,  Px  <  P 

0  otherwise. 


Clearly,  qx'1!’  >  g  and 


|g|d(A  x  m)  < 


1 


1  —  Px 


\qx\d{\  x  m)  = 


1 


i  r  r  i 


1  —  fix 


\qx(/3)\d(3 


d\(a), 


where  the  equality  follows  by  Tonelli-Fubini’s  Theorem.  The  inner  integral  simplifies  to 
r  1  /•!  [Px  rPx 

/  \qx(p)\dp=  qx(/3)d/3-  qx(/3)d/3  =  (1  -  Px)qx(Px)  -  /  qx(p)dp. 

Jo  Jpx  Jo  Jo 

The  last  term  requires  further  simplification.  Recall  that  for  a  £  (0, 1), 


[  qx(P)dP  =  --f  q-X(P)dP  = -q-x(l- a). 

Jo  a  Jl-a 


(21) 


Applying  this  result,  the  inner  integral  from  above  simplifies  further  to 

i 

\qx(P)\dP  =  (1  —  (3x)qx(Px )  +  Pxq~x(  1  —  Px)  <  oo. 

Consequently  in  view  of  (21),  g  is  integrable  and  therefore  furnishes  the  necessary  lower-bounding, 
integrable  function  in  Proposition  A.l,  which  completes  part  (iii).  We  are  therefore  permitted  to 
interchange  the  order  of  integration  in  (20)  and  get 


i  r  r  i 


nx)  = 


qx{P)^{a,P)dp 


d\(a)  =  /  qx(P) 


^(a,  ft)  d\(a) 


dp  =  qx(P)<p(P)dP, 


where  the  last  equality  follows  from  the  definition  of  ip. 

The  hnal  assertions  follow  from  recognizing  that  the  Lebesgue-Stieltjes  measure  dip  associated  with 
a  function  p  has  dp(a)  =  dX(a)  for  a  weighting  measure  A  on  [0, 1).  □ 

Now  we  articulate  other  definitions  and  technical  results  required  in  the  paper. 


A. 2  Definition  Let  (T,  A ,  g)  be  a  complete  measure  space,  with  g  sigma-finite,  X  a  separable  reflexive 
Banach  space,  and  A4  a  linear  subspace  of  the  linear  space  of  all  (A,  Bx) -measurable  functions  x  :  T  — * 
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X.  The  set  A4  is  (A,  Bx) -decomposable  if,  whenever  x  €  M.  and  xo  :  S  X  is  a  bounded  (, A,Bx )- 
measurable  function  on  a  set  S  G  A,  with  n(S)  <  oo,  then  the  function  y  :  T  — >•  X  given  by 


also  belongs  to  M. . 


y(t) 


xo  (t)  if  t  E  S 
x(t)  if  t  E  T  \  S 


A. 3  Definition  In  the  notation  of  Definition  A. 2,  we  say  that  a  function  f  :  T  x  X  — >•  (—00,00]  is  a 
normal  integrand  if  the  following  hold: 

(i)  f  is  {A  <8>  13 x) -measurable  and 

(ii)  for  every  t  G  T,  f(t,  •)  is  lower  semicontinuous  on  X  and  not  identical  to  00. 


A. 4  Proposition  Suppose  that  the  conditions  and  notation  of  Definition  A. 2  hold  and  /  iTxA'^ 
(—00, 00]  is  a  normal  integrand.  Then,  the  following  hold: 

(i)  the  functions  t  e-x  inf f(t,0  and  t.  e-x  f(t,x(t)),  with  x  :  T  — x  X  (A,  Bx) -measurable,  are 
A-measurable  and 

(ii)  if  AA  is  (A,  Bx) -decomposable  and  there  exists  an  x  €  M  such  that  f  f(t,x(t))  dp(t)  <  00,  then 

inf  f  f(t,x(t))  dp(t)  =  f  tp(t)  dp(t),  where  tp(t)  =  inf  /(£,£)•  (22) 

xeM  J  J  £ex 


Proof.  First,  we  consider  t  e-x  inf $&*/(£,£)•  F°r  measurable  spaces  (X\,Ai)  and  (A2,A2),  we  recall 
that  a  set-valued  mapping  S  :  X\  X2  is  (Ai,  A2)-measurable  if  its  graph  is  measurable  in  the  sense 
that 

{(xi,x2)  €  X\  X  X2  |  x2  G  5(xi)}  eii®  Aii 

where  A\  ®  A2  is  the  product  sigma-algebra  generated  by  A\  and  A2.  Since  /  is  a  normal  integrand,  the 
set-valued  mapping  t  e-x  epi/(t,  •)  is  A-measurable  and  closed- valued;  see  for  example  [18,  Proposition 
1].  By  [18,  Theorem  1(f)],  there  exists  a  countable  collection  {gi}i£i  of  A-measurable  functions  gi  : 
T  ^  X  X  1R  of  the  form  gilt)  =  (xj(i),  ai(t)),  x,(t)  €  A  and  cq(f)  €  -ff?,  such  that 

epi  fit,  •)  =  cl{ (i) for  all  t  G  T, 

where  cl  denotes  closure.  The  mapping  1 1— >•  Oj(t)  is  also  A-measurable.  Consequently, 

inf  fit,  A  =  inf  oAi)  for  all  i  €  T 
£ex  iei 

and  the  conclusion  follows  from  the  fact  that  the  pointwise  infimum  of  a  countable  collection  of  mea¬ 
surable  functions  is  a  measurable  function. 

Second,  we  consider  t  e-x  /(f,x(t)),  which  is  a  composition  of  /  with  the  measurable  mapping 
t  e-x  (t,  x(f))  and  therefore  measurable. 
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Third,  we  establish  part  (ii)  by  following  the  arguments  in  the  proof  of  Theorem  2  in  [18].  By 
assumption  there  exists  a  function  x\  £  M.  and  a  //-integrable  function  a\  :T  ^  1R  such  that 

f(t,xi(t))  <  a\ (t)  for  every  t  £  T. 

Since  pit.)  <  f(t,x(t))  for  every  function  x  £  M  and  t  £  T  by  definition  and  ip  is  ^.-measurable  by 
part  (i),  the  integral  of  ip  is  well-defined  and  either  finite  or  equals  — oo.  Consequently,  the  inequality 
>  holds  in  (22).  Now,  let  7  €  JR  be  such  that 

J  tp(t)  dp(t)  <  7.  (23) 

We  will  prove  the  existence  of  a  function  x  £  A4  such  that 

J  f(t,x{t))  dp(t)  <  7,  (24) 

thereby  establishing  part  (ii).  From  (23)  and  the  properties  of  (T,A,p),  there  exists  a  /j-integrable 
function  ao  :  T  ^  1R  such  that  <p(t)  <  ao(i)  for  every  t  £  T  and 

J  a0(t )  dp(t)  <  7.  (25) 

We  define  the  set- valued  mapping  S  :  T  A  X  by 

Sit)  =  {£  €  X  |  f{t A)  <  «0 it)}  for  t  £  T. 

Since  the  function  (i,£)  i->-  /(t,  ^)  —  ao(t)  is  (^4  <S>  ^)-measurable,  S  is  also  *4-measurable.  Moreover, 
Sit.)  is  for  each  t  £  T  closed  and  nonempty.  Since  S  is  A- measurable,  there  exists  a  A- measurable 
selection  xq.  i.e. ,  a  M-measurable  function  xq  such  that  xq it)  £  5(f)  for  every  t  £  T;  see  for  example  the 
corollary  of  Theorem  1  in  [18].  Since  (25)  holds,  there  exists  a  measurable  set  To  C  T,  with  //(To)  <  00, 
such  that 

/  aoit)  dpit)  +  /  07 (t)  dpit)  <  7.  (26) 

Jt0  Jt\t0 

By  the  construction  of  S  in  terms  of  ao,  the  measurable  selection  xq  can  be  chosen  to  be  bounded  on 
To.  Let  x  :  T  — >•  X  be  such  that  x(t)  =  xq  it)  for  t  £  Tq  and  x(t)  =  x\ it)  for  t  £  T\Tq.  Then,  x  £  M.  by 
the  assumption  of  decomposability,  and  we  have  that  /(f,.x(t))  <  ao{t)  for  t  £  Tq  and  fit,  x{t))  <  ai(f) 
for  t  £  T  \  To.  From  (26)  we  then  conclude  (24),  which  establishes  part  (ii).  □ 

A. 5  Lemma  If  q  :  [0, 1)  — >  C2  is  (jBr0ii),  £>£2) -measurable,  then 

(i)  the  function  fi  :  [0, 1)  x  fl  -»  1R  given  by  fi(/3,u)  =  g(/3)(w)  is  (-B[o,i)  <8)  F) -measurable,  and 

(ii)  the  function  fi  :  [0, 1)  — >•  1R  given  by  /b (/?)  =  || t/(/5) II2  is  BiQ  ^-measurable. 

Proof.  For  part  (i)  simply  observe  that  f\  =  g  o  h,  where  h  :  [0, 1)  X  Q  -)■  C2  x  f2,  with  h(a,ui)  = 
iqia),  in),  and  g  :  C2  x  fl  — >  1R,  with  giQ,  u)  =  Q(w).  The  conclusion  then  follows  from  the  measurability 
of  q  and  elements  of  C2 ,  and  the  fact  that  composition  of  measurable  functions  is  measurable.  Next, 
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we  consider  part  (ii).  A  trivial  extension  of  part  (i)  establishes  that  the  function  (/3,w)  i->-  [q(/3)( u)]2 
is  (jB[0ii)  <S>  J7)-measurable.  Since  it  is  also  nonnegative,  it  follows  from  Tonelli-Fubini’s  Theorem  that 
[, f-2{-)}2  is  Buy i ^-measurable.  □ 

The  following  is  a  direct  consequence  of  Definition  A. 2. 

A. 6  Proposition  The  set  A4  is  (Hr0)i),  B £2) -decomposable. 
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